95 research outputs found

    Scoring lexical entailment with a supervised directional similarity network

    Get PDF
    Scoring Lexical Entailment with a Supervised Directional Similarity NetworkERC Nvidi

    On the relation between linguistic typology and (limitations of) multilingual language modeling

    Get PDF
    A key challenge in cross-lingual NLP is developing general language-independent architectures that are equally applicable to any language. However, this ambition is largely hampered by the variation in structural and semantic properties, i.e. the typological profiles of the world's languages. In this work, we analyse the implications of this variation on the language modeling (LM) task. We present a large-scale study of state-of-the art n-gram based and neural language models on 50 typologically diverse languages covering a wide variety of morphological systems. Operating in the full vocabulary LM setup focused on word-level prediction, we demonstrate that a coarse typology of morphological systems is predictive of absolute LM performance. Moreover, fine-grained typological features such as exponence, flexivity, fusion, and inflectional synthesis are borne out to be responsible for the proliferation of low-frequency phenomena which are organically difficult to model by statistical architectures, or for the meaning ambiguity of character n-grams. Our study strongly suggests that these features have to be taken into consideration during the construction of next-level language-agnostic LM architectures, capable of handling morphologically complex languages such as Tamil or Korean.ERC grant Lexica

    HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment

    Get PDF
    We introduce HyperLex — a dataset and evaluation resource that quantifies the extent of of the semantic category membership, that is, type-of relation also known as hyponymy–hypernymy or lexical entailment (LE) relation between 2,616 concept pairs. Cognitive psychology research has established that typicality and category/class membership are computed in human semantic memory as a gradual rather than binary relation. Nevertheless, most NLP research, and existing large-scale inventories of concept category membership (WordNet, DBPedia, etc.) treat category membership and LE as binary. To address this, we asked hundreds of native English speakers to indicate typicality and strength of category membership between a diverse range of concept pairs on a crowdsourcing platform. Our results confirm that category membership and LE are indeed more gradual than binary. We then compare these human judgments with the predictions of automatic systems, which reveals a huge gap between human performance and state-of-the-art LE, distributional and representation learning models, and substantial differences between the models themselves. We discuss a pathway for improving semantic models to overcome this discrepancy, and indicate future application areas for improved graded LE systems.This work is supported by the ERC Consolidator Grant (no 648909)

    Airborne observations of the Eyjafjalla volcano ash cloud over Europe during air space closure in April and May 2010

    Get PDF
    © Author(s) 2011. This work is distributed under the Creative Commons Attribution 3.0 LicenseAirborne lidar and in-situ measurements of aerosols and trace gases were performed in volcanic ash plumes over Europe between Southern Germany and Iceland with the Falcon aircraft during the eruption period of the Eyjafjalla1 volcano between 19 April and 18 May 2010. Flight planning and measurement analyses were supported by a refined Meteosat ash product and trajectory model analysis. The volcanic ash plume was observed with lidar directly over the volcano and up to a distance of 2700 km downwind, and up to 120 h plume ages. Aged ash layers were between a few 100 m to 3 km deep, occurred between 1 and 7 km altitude, and were typically 100 to 300 km wide. Particles collected by impactors had diameters up to 20 μm diameter, with size and age dependent composition. Ash mass concentrations were derived from optical particle spectrometers for a particle density of 2.6 g cm-3 and various values of the refractive index (RI, real part: 1.59; 3 values for the imaginary part: 0, 0.004 and 0.008). The mass concentrations, effective diameters and related optical properties were compared with ground-based lidar observations. Theoretical considerations of particle sedimentation constrain the particle diameters to those obtained for the lower RI values. The ash mass concentration results have an uncertainty of a factor of two. The maximum ash mass concentration encountered during the 17 flights with 34 ash plume penetrations was below 1 mg m-3. The Falcon flew in ash clouds up to about 0.8 mg m-3 for a few minutes and in an ash cloud with approximately 0.2 mg -3 mean-concentration for about one hour without engine damage. The ash plumes were rather dry and correlated with considerable CO and SO2 increases and O3 decreases. To first order, ash concentration and SO2 mixing ratio in the plumes decreased by a factor of two within less than a day. In fresh plumes, the SO2 and CO concentration increases were correlated with the ash mass concentration. The ash plumes were often visible slantwise as faint dark layers, even for concentrations below 0.1 mg m-3. The large abundance of volatile Aitken mode particles suggests previous nucleation of sulfuric acid droplets. The effective diameters range between 0.2 and 3 μm with considerable surface and volume contributions from the Aitken and coarse mode aerosol, respectively. The distal ash mass flux on 2 May was of the order of 500 (240-1600) kgs -1. The volcano induced about 10 (2.5-50) Tg of distal ash mass and about 3 (0.6-23) Tg of SO2 during the whole eruption period. The results of the Falcon flights were used to support the responsible agencies in their decisions concerning air traffic in the presence of volcanic ash.Peer reviewe

    Resonance Fluorescence Spectrum of a Trapped Ion Undergoing Quantum Jumps

    Full text link
    We experimentally investigate the resonance fluorescence spectrum of single 171Yb and 172Yb ions which are laser cooled to the Lamb-Dicke regime in a radiofrequency trap. While the fluorescence scattering of 172Yb is continuous, the 171Yb fluorescence is interrupted by quantum jumps because a nonvanishing rate of spontaneous transitions leads to electron shelving in the metastable hyperfine sublevel 2D3/2(F=2). The average duration of the resulting dark periods can be varied by changing the intensity of a repumping laser field. Optical heterodyne detection is employed to analyze the fluorescence spectrum near the Rayleigh elastic scattering peak. It is found that the stochastic modulation of the fluorescence emission by quantum jumps gives rise to a Lorentzian component in the fluorescence spectrum, and that the linewidth of this component varies according to the average duration of the dark fluorescence periods. The experimental observations are in quantitative agreement with theoretical predictions.Comment: 14 pages including 4 figures, pdf file, fig.1 replace
    corecore